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-- The MAILING DATE of this communication appears on the cover sheet with the correspondence address- 
All claims being allowable, PROSECUTION ON THE MERITS IS (OR REMAINS) CLOSED in this application. If not included 
herewith (or previously mailed), a Notice of Allowance (PTOL-85) or other appropriate communication will be mailed in due course. THIS 
NOTICE OF ALLOWABILITY IS NOT A GRANT OF PATENT RIGHTS. This application is subject to withdrawal from issue at the initiative 
of the Office or upon petition by the applicant. See 37 CFR 1 .31 3 and MPEP 1 308. 

1 . ^ This communication is responsive to Appeal Brief fifed 19 November 2007 . 

2. [El The allowed claim(s) is/are 8. 11-15. 17-23, 26-29 (renumbered 1-17) . 

3. □ Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 1 19(a)-(d) or (f). 

a) □ All b) □ Some* c) □ None of the: 

1. □ Certified copies of the priority documents have been received. 

2. □ Certified copies of the priority documents have been received in Application No. . 

3. □ Copies of the certified copies of the priority documents have been received in this national stage application from the 

International Bureau (PCT Rule 17.2(a)). 
* Certified copies not received: . 

Applicant has THREE MONTHS FROM THE "MAILING DATE" of this communication to file a reply complying with the requirements 
noted below. Failure to timely comply will result in ABANDONMENT of this application. 
THIS THREE-MONTH PERIOD IS NOT EXTENDABLE. 

4. □ A SUBSTITUTE OATH OR DECLARATION must be submitted. Note the attached EXAMINER'S AMENDMENT or NOTICE OF 

INFORMAL PATENT APPLICATION (PTO-152) which gives reason(s) why the oath or declaration is deficient. 

5. □ CORRECTED DRAWINGS ( as "replacement sheets") must be submitted. 

(a) □ including changes required by the Notice of Draftsperson's Patent Drawing Review ( PTO-948) attached 

1) □ hereto or 2) □ to Paper No./Mail Date . 

(b) □ Including changes required by the attached Examiner's Amendment / Comment or in the Office action of 

Paper No./Mail Date . 

Identifying indicia such as the application number (see 37 CFR 1, 84(c)) should be written on the drawings in the front (not the back) of 
each sheet. Replacement sheet(s) should be labeled as such in the header according to 37 CFR 1.121(d). 

6. □ DEPOSIT OF and/or INFORMATION about the deposit of BIOLOGICAL MATERIAL must be submitted. Note the 

attached Examiner's comment regarding REQUIREMENT FOR THE DEPOSIT OF BIOLOGICAL MATERIAL. 
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EXAMINER'S AMENDMENT 

1 . An examiner's amendment to the record appears below. Should the changes 
and/or additions be unacceptable to applicant, an amendment may be filed as provided 
by 37 CFR 1 .312. To ensure consideration of such an amendment, it MUST be 
submitted no later than the payment of the issue fee. 

Authorization for this examiner's amendment was given in a telephone interview 
with Pam Riley on January 7, 2008. 

The application has been amended as follows (NOTE: only amended claims are 

listed): 

8. A system for clustering documents in datasets comprising: 

a storage having a first dataset and a second dataset; 

a cluster generator operative to cluster first documents in said first dataset and 
produce first document classes; 

a centroid seed generator operative to generate centroid seeds based on said 
first document classes; 

a dictionary generator adapted to generate a first dictionary of most common 
words in said first dataset; and 

a vector space model generator adapted to generate a first vector space model 
by counting, for each word in said first dictionary, a number of said first documents in 
which said word occurs, 
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wherein said cluster generator clusters said documents in said first dataset 
based on said first vector space model, 

wherein said cluster generator clusters second documents in said second 
dataset using said centroid seeds, such that said second dataset has a similar based on 
said centroid seeds, clustering to that of said first dataset, and 

wherein said second dataset comprises a new, but related, based on said 
centroid seeds, dataset different than said first dataset. 

1 1 . The system in claim [1 0] 8, wherein said vector space model generator 
generates a second vector space model by counting, for each word in said first 
dictionary, a number of said second documents in which said word occurs. 

17. The method in claim [16] 15, wherein said generating of said second vector 
space model comprises counting, for each word in said first dictionary, a number of said 
documents in which said word occurs. 

26. A program device readable by machine, tangibly embodying a program of 
instructions executable by the machine to perform said method in claim [25] 24, said 
method further comprising generating a second vector space model by counting, for 
each word in said first dictionary, a number of said second documents in which said 
word occurs. 
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In the Specification, page 8, lines 15-17: 

Then, the vector space model generator 210 counts, .for each word in the first 
dictionary D1 206, the number of documents in which the word in question appears, to 
produce a T1-D1 vector space [mode] model 202. 

2. The following is an examiner's statement of reasons for allowance: The prior art 
of record fails to disclose clustering a second dataset of documents based upon 
centroid seeds generated based upon a first dataset of documents. 

Any comments considered necessary by applicant must be submitted no later 
than the payment of the issue fee and, to avoid processing delays, should preferably 
accompany the issue fee. Such submissions should be clearly labeled "Comments on 
Statement of Reasons for Allowance." 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Kyle R. Stork whose telephone number is (571) 272- 
4130. The examiner can normally be reached on Monday-Friday (8:00-4:30). 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Stephen Hong can be reached on (571) 272-4124. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 



Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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